Toward Improving b-Coloring based Clustering using a Greedy re-Coloring Algorithm
نویسندگان
چکیده
Clustering is an important task in the process of data analysis which can be viewed as a data modeling technique that provides an attractive mechanism to automatically find the hidden structure of large data sets (Jain et al., 1999). Informally, this task consists of the division of data items (objects, instances, etc.) into groups or categories, such that all objects in the same group are similar to each other, while dissimilar from objects in the other groups. Clustering plays an important role in data mining applications such as Web analysis, information retrieval, medical diagnosis, and many other domains. Recently, we have proposed a clustering method based on the concept of b-coloring of a graph (Irving & Manlov, 1999). A graph b-coloring is an assignment of colors to the vertices of the graph such that: i. no two adjacent vertices (vertices joined by an weighted edge representing the dissimilarity between objects) have the same color (proper coloring) ii. for each color, there exists at least one vertex which is adjacent (has a sufficient dissimilarity degree) to all other colors. This vertex is called a dominating vertex; there can be many within the same class. Both (i) and (ii) are the constraints in b-coloring of a graph. The b-coloring based clustering method enables to build a fine partition of the dataset into clusters even when the number of clusters is not specified in advance. The previous clustering algorithm in (Elghazel et al., 2006) conducts the following two steps in greedy fashion: 1. initalizes the colors of vertices so that the colors satisfy proper coloring, and 2. removes, by a greedy procedure, the colors that have no dominating vertices, until each color has at least one dominating vertex. These steps correspond to the above two constraints in b-coloring. Although it returns a bcoloring of a graph, it does not explicitly consider the quality of the clusters in the algorithm. Thus, besides satisfying the above constraints, it was difficult to explicitly generate better clusters of the given data items. O pe n A cc es s D at ab as e w w w .in te ch w eb .o rg
منابع مشابه
الگوریتم ژنتیک با جهش آشوبی هوشمند و ترکیب چندنقطهای مکاشفهای برای حل مسئله رنگآمیزی گراف
Graph coloring is a way of coloring the vertices of a graph such that no two adjacent vertices have the same color. Graph coloring problem (GCP) is about finding the smallest number of colors needed to color a given graph. The smallest number of colors needed to color a graph G, is called its chromatic number. GCP is a well-known NP-hard problems and, therefore, heuristic algorithms are usually...
متن کاملTabu Search for Graph Coloring, T-colorings and Set T-colorings
In this paper, a generic tabu search is presented for three coloring problems: graph coloring, T-colorings and set T-colorings. This algorithm integrates important features such as greedy initialization, solution re-generation, dynamic tabu tenure, incremental evaluation of solutions and constraint handling techniques. Empirical comparisons show that this algorithm approaches the best coloring ...
متن کاملSta Scheduling: A Simple Approach that Worked
This paper describes our experiences in solving a real staa scheduling problem. We experimented with a variety of approaches that included: studying the structure of the problem as list-coloring in interval graphs, using constraint-based methods, using greedy algorithms, and using a variety of iterative improvement approaches. In the end, a very simple randomized greedy algorithm proved able to...
متن کاملA Graph b-coloring Based Scheme for Composition-Oriented Web Services Abstraction: COWSA
We propose in this paper a self-learning scheme named COWSA, which aims to enhance the performances of existing Web services composition algorithms. Our scheme is based on a new dynamic clustering of Web services, that is oriented to Web services composition. This clustering is performed through using the b-coloring of graphs. We conduct a series of experiments to evaluate the contribution and ...
متن کاملA practical algorithm for [r, s, t]-coloring of graph
Coloring graphs is one of important and frequently used topics in diverse sciences. In the majority of the articles, it is intended to find a proper bound for vertex coloring, edge coloring or total coloring in the graph. Although it is important to find a proper algorithm for graph coloring, it is hard and time-consuming too. In this paper, a new algorithm for vertex coloring, edge coloring an...
متن کامل